Temporal Relational Modeling with Self-Supervision for Action Segmentation

نویسندگان

چکیده

Temporal relational modeling in video is essential for human action understanding, such as recognition and segmentation. Although Graph Convolution Networks (GCNs) have shown promising advantages relation reasoning on many tasks, it still a challenge to apply graph convolution networks long sequences effectively. The main reason that large number of nodes (i.e., frames) makes GCNs hard capture model temporal relations videos. To tackle this problem, paper, we introduce an effective GCN module, Dilated Reasoning Module (DTGRM), designed dependencies between frames at various time spans. In particular, via constructing multi-level dilated graphs where the represent from different moments video. Moreover, enhance ability proposed model, auxiliary self-supervised task encourage module find correct wrong Our DTGRM outperforms state-of-the-art segmentation models three challenging datasets: 50Salads, Georgia Tech Egocentric Activities (GTEA), Breakfast dataset. code available https://github.com/redwang/DTGRM.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Temporal Subspace Clustering for Unsupervised Action Segmentation

Action segmentation (segmenting a continuous sequence of motion data into a set of actions) has a wide range of applications and plays a role in many problems in computer vision. We look at subspace clustering as an unsupervised approach for this task. Classical subspace clustering methods uncover relationships within the data by learning codes for the samples (i.e. frames), but in this process...

متن کامل

Modeling Temporal Crowd Work Quality with Limited Supervision

While recent work has shown that a worker’s performance can be more accurately modeled by temporal correlation in task performance, a fundamental challenge remains in the need for expert gold labels to evaluate a worker’s performance. To solve this problem, we explore two methods of utilizing limited gold labels, initial training and periodic updating. Furthermore, we present a novel way of lea...

متن کامل

Temporal Human Action Segmentation via Dynamic Clustering

We present an effective dynamic clustering algorithm for the task of temporal human action segmentation, which has comprehensive applications such as robotics, motion analysis, and patient monitoring. Our proposed algorithm is unsupervised, fast, generic to process various types of features, and applicable in both the online and offline settings. We perform extensive experiments of processing d...

متن کامل

Causal Modeling for Supervision

Today, process control and monitoring are evolving to include plant safety and availability management, on-line diagnosis and maintenance policy. Human operators are at the highest hierarchical level in the organization of the control system. Supervision aims are to assist control operators in their decisionmaking tasks, to help them understand and identify operating situations underway in a ma...

متن کامل

Learning Semantic Segmentation with Diverse Supervision

Models based on deep convolutional neural networks (CNN) have significantly improved the performance of semantic segmentation. However, learning these models requires a large amount of training images with pixel-level labels, which are very costly and time-consuming to collect. In this paper, we propose a method for learning CNNbased semantic segmentation models from images with several types o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i4.16377